Auto-Weighted Multi-View Clustering for Large-Scale Data
نویسندگان
چکیده
Multi-view clustering has gained broad attention owing to its capacity exploit complementary information across multiple data views. Although existing methods demonstrate delightful performance, most of them are high time complexity and cannot handle large-scale data. Matrix factorization-based models a representative solving this problem. However, they assume that the views share dimension-fixed consensus coefficient matrix view-specific base matrices, limiting their representability. Moreover, series algorithms bear one or more hyperparameters impractical in real-world applications. To address two issues, we propose an auto-weighted multi-view (AWMVC) algorithm. Specifically, AWMVC first learns matrices from corresponding different dimensions, then fuses obtain optimal matrix. By mapping original features into distinctive low-dimensional spaces, can attain comprehensive knowledge, thus obtaining better results. design six-step alternative optimization algorithm proven be convergent theoretically. Also, shows excellent performance on various benchmark datasets compared with ones. The code is publicly available at https://github.com/wanxinhang/AAAI-2023-AWMVC.
منابع مشابه
Guided Co-training for Large-Scale Multi-View Spectral Clustering
In many real-world applications, we have access to multiple views of the data, each of which characterizes the data from a distinct aspect. Several previous algorithms have demonstrated that one can achieve better clustering accuracy by integrating information from all views appropriately than using only an individual view. Owing to the effectiveness of spectral clustering, many multi-view clus...
متن کاملLarge-Scale Multi-View Spectral Clustering via Bipartite Graph
In this paper, we address the problem of large-scale multi-view spectral clustering. In many real-world applications, data can be represented in various heterogeneous features or views. Different views often provide different aspects of information that are complementary to each other. Several previous methods of clustering have demonstrated that better accuracy can be achieved using integrated...
متن کاملWeighted Multi-view Clustering with Feature Selection
In recent years, combining multiple sources or views of datasets for data clustering has been a popular practice for improving clustering accuracy. As different views are different representations of the same set of instances, we can simultaneously use information from multiple views to improve the clustering results generated by the limited information from a single view. Previous studies main...
متن کاملRobust auto-weighted multi-view subspace clustering with common subspace representation matrix
In many computer vision and machine learning applications, the data sets distribute on certain low-dimensional subspaces. Subspace clustering is a powerful technology to find the underlying subspaces and cluster data points correctly. However, traditional subspace clustering methods can only be applied on data from one source, and how to extend these methods and enable the extensions to combine...
متن کاملA partition-based algorithm for clustering large-scale software systems
Clustering techniques are used to extract the structure of software for understanding, maintaining, and refactoring. In the literature, most of the proposed approaches for software clustering are divided into hierarchical algorithms and search-based techniques. In the former, clustering is a process of merging (splitting) similar (non-similar) clusters. These techniques suffered from the drawba...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2023
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v37i8.26201